From Chunks to function-Argument Structure: A Similarity-Based Approach

نویسندگان

  • Sandra Kübler
  • Erhard W. Hinrichs
چکیده

Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. Such larger structures are not only desirable for a deeper syntactic analysis. They also constitute a necessary prerequisite for assigning function-argument structure. The present paper offers a similaritybased algorithm for assigning functional labels such as subject, object, head, complement, etc. to complete syntactic structures on the basis of prechunked input. The evaluation of the algorithm has concentrated on measuring the quality of functional labels. It was performed on a German and an English treebank using two different annotation schemes at the level of function-argument structure. The results of 89.73 % correct functional labels for German and 90.40 % for English validate the general approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Appropriation Based -Syllabus and Advanced EFL Learners’ Speaking Skill: The Case of Chunks-on-Card Activities

The impetus for conducting the present study came from Thornbury's (2005) approach to teach speaking in which he claimed that awareness-raising techniques, along with appropriation strategies, facilitate the process of teaching and learning speaking. Therefore, the present study attempted to explore the impact of the appropriation-based syllabus to teach speaking by using chunks-on-card...

متن کامل

Similarity Based Deduplication with Small Data Chunks

Large backup and restore systems may have a petabyte or more data in their repository. Such systems are often compressed by means of deduplication techniques, that partition the input text into chunks and store recurring chunks only once. One of the approaches is to use hashing methods to store fingerprints for each data chunk, detecting identical chunks with very low probability for collisions...

متن کامل

Phrase reordering for statistical machine translation based on predicate-argument structure

In this paper, we describe a novel phrase reordering model based on predicate-argument structure. Our phrase reordering method utilizes a general predicate-argument structure analyzer to reorder source language chunks based on predicate-argument structure. We explicitly model longdistance phrase alignments by reordering arguments and predicates. The reordering approach is applied as a preproces...

متن کامل

A Psychoanalytic Reading of Cyberspace: Problematizing the Digitalization of Oedipus Complex and the Dialectic of Subjectivity and Castration in the Cyberspace

In the present paper, a translational model to psychoanalyze the cyberspace is presented with the argument that cyberspace is a translated version of human unconscious that projects both our unfulfilled desires and suppressed anxieties. This Freudian-based line of argument is followed by Lacanian (1950s)and Zizekian (2004) psychoanalysis to problematize the digitalization of Oedipus complex and...

متن کامل

TüSBL: A Similarity-Based Chunk Parser for Robust Syntactic Processing

Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. The TüSBL parser extends current chunk parsing techniques by a tree-construction component that extends partial chunk parses to complete tree s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001